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ABSTRACT 

RNA dimerization is an essential step in the retro- 
viral life cycle. Dimerization and encapsidation 
signals, closely linked in HIV-2, are located in the 
leader RNA region. The SL1 motif and nucleocapsid 
protein are considered important for both pro- 
cesses. In this study, we show the structure of the 
HIV-2 leader RNA (+1-560) captured as a loose 
dimer. Potential structural rearrangements within 
the leader RNA were studied. In the loose dimer 
form, the HIV-2 leader RNA strand exists in vitro as 
a single global fold. Two kissing loop interfaces 
within the loose dimer were identified: SL1/SL1 
and TAR/TAR. Evidence for these findings is pro- 
vided by RNA probing using SHAPE, chemical 
reagents, enzymes, non-denaturing PAGE mobility 
assays, antisense oligonucleotides hybridization 
and analysis of an RNA mutant. Both TAR and SL1 
as isolated domains are bound by recombinant 
NCp8 protein with high affinity, contrary to the 
hairpins downstream of SL1. Foot-printing of the 
SL1/NCp8 complex indicates that the major binding 
site maps to the SL1 upper stem. Taken together, 
these data suggest a model in which TAR hairpin III, 
the segment of SL1 proximal to the loop and the PAL 
palindromic sequence play specific roles in the 
initiation of dimerization. 

INTRODUCTION 

Retroviruses selectively encapsidate a dimeric RNA 
genome assembled from two identical positive sense 
strands interacting near their 5'-ends. It is presumed that 
in the case of HIV-2, co-translational capture of the 
full-length RNA by the nucleocapsid (NC) domain of 
Gag ensures encapsidation specificity (1). The RNA 
motifs important for interaction with the Gag protein 



are located in the leader RNA region (5'-UTR) that 
contains highly structured domains which play various 
regulatory roles in viral replication (2). 

The essential motif for HIV-1 dimerization has been 
extensively studied and is called the dimerization initiation 
site (DIS) (3) or stem-loop 1 SL1 (4). In vitro studies using 
short RNA constructs (5) showed that two DIS RNA 
molecules interact via a kissing-loop forming a loose 
dimer that may be further stabilized into an extended 
duplex (tight dimer). However, it has been shown that 
HIV-1 can replicate in the absence of the functional DIS 
(6), and the extended structure of the entire 5'-UTR dimer 
has not been confirmed (7). Loose dimers could be 
detected only under native electrophoretic conditions 
(TBM); they dissociate during semi-native gel electrophor- 
esis (TBE). 

Because the mechanism of dimerization and encapsida- 
tion of HIV-2 differs somewhat from that of HIV-1 
(8-10), HIV-1 RNA cannot be packaged into HIV-2 
virions (1). Two dimerization signals within the HIV-2 
leader RNA have been proposed: SL1 (similar to HIV-1 
SL1) (1 1,12) and a self-complementary sequence located in 
the PBS (10). Alternate usage of these sites (13) and in- 
volvement of TAR RNA (fran.y-activation responsive 
region) (14,15) in the dimerization have also been sug- 
gested. Similarly to HIV-1 (16), the LDI/BMH global 
riboswitch was proposed (17) in which presentation of 
the SL1 apical loop palindrome and its ability to 
dimerize are influenced by structural rearrangements of 
the entire HIV-2 leader RNA. In the case of HIV-1, 
in vivo RNA structural probing (7,18) did not confirm 
the riboswitch model. Also similar to HIV-1 (6), HIV-2 
can replicate without the SL1 apical loop palindrome. 
Certain mutants with deletions and/or substitutions in 
SL1 had no effect on viral replication, while mutations 
in the * region (encompassing the sequence between the 
PBS and SL1) were found to be strongly inhibitory 
(19,20). Within this region, a 10-nt palindromic sequence 
called PAL (+392-401) is believed to play an important 
regulatory role in HIV-2 dimerization (12,20). Specifically, 
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it has been proposed that the 3'-end of PAL pairs with a 
sequence immediately downstream of SL1. This motif, 
referred to as stem B, effectively stabilizes SL1 and 
prevents its participation in the extended interstrand 
base pairing predicted to exist in tight dimers (8,21). A 
different SL1 secondary-structure model has also been 
proposed in which only a short hairpin encompassing 
+409-436 residues is formed and the PAL sequence 
either remains unconstrained (9,12,22,23) or interacts 
with the upstream region (+377-386) to form a separate 
stem-loop motif (19,24). A model of dimerization 
regulated by alternate presentation of the PAL and SL1 
sequences produced by local RNA rearrangements in the 
monomeric RNA has also been suggested (24). 

A regulatory role in HIV dimerization has also been 
proposed for the long-range U5-AUG interaction, referr- 
ing to the base-pairing of a C-rich sequence located be- 
tween the poly(A) and PBS domains with a G-rich 
sequence around the Gag translation initiation region 
(9,12,25). The HIV-2 U5-AUG is abbreviated CGI 
[C-box/G-box interaction; (8)]. In contrast to HIV-1, for- 
mation of U5-AUG duplex impairs HIV-2 RNA tight di- 
merization, probably due to intramolecular entrapment of 
the SL1 motif (8). 

The dimerization and encapsidation signals are closely 
linked in HIV-2 and the NC protein (abbreviated NCp8), 
as a domain of Gag, is involved in both processes (19). 
The NC protein contains two zinc finger motifs. The 
HIV-1 protein (NCp7), rich in arginine and lysine, has 
been shown to be a potent chaperone for both nucleic 
acid folding and unfolding (26), and sites of interaction 
between NCp7 and HIV-1 RNA have been characterized 
(7,27,28). 

Our knowledge of interactions between NC protein and 
the HIV-2 leader RNA is based mostly on the results 
obtained using GST tagged NCp7 together with HIV-2 
leader RNA (27). Although NCp7 and NCp8 share 67% 
similarity, differences in packaging and dimerization have 
been reported for HIV-1 and HIV-2 (1,8,10). Both zinc 
finger motifs in HIV-2 NC have been shown to specifically 
interact with viral RNA during genome encapsidation 
(29), which is in marked contrast to HIV-1 (30). 
Moreover, although previous studies have demonstrated 
site-specific contacts between HIV-1 NC and structural 
elements within the leader region of HIV-1 RNA (27), 
similar interactions between NCp8 and HIV-2 leader 
motifs have yet to be identified. 

The results of in vitro structural mapping of the HIV-2 
leader RNA (+1-560 nt) and shorter transcripts repre- 
senting motifs considered important for dimerization, 
encapsidation and interactions with NCp8 protein are pre- 
sented. The proposed secondary structure of the HIV-2 
leader RNA represents a single global fold captured 
in vitro as a loose dimer. Two contact interfaces in the 
loose dimer form were identified within the SL1 and 
TAR motifs. Recombinant HIV-2 NC exhibits tight bind- 
ing to SL1 (K d = 125 ± 25 nM) and TAR (K d = 450 ± 
50 nM), but poor binding to hairpin motifs at the leader 
3'-end (*1, SD, *2 and *3). Our in vitro results allow us 
to propose a model for the initiation of HIV-2 RNA 
dimerization. 



MATERIALS AND METHODS 

RNA synthesis 

The DNA templates for in vitro transcription were 
generated by PCR amplification of the plasmid HIV-2 ^ 
large (31) containing a T7 promoter directly upstream of 
the +1 position of the wild-type HIV-2 ROD sequence. 
PCR primers used for template preparation are presented 
in Supplementary Table SI (see Supplementary Data). 
The DNA template for the UTRh3 mutant was generated 
using the megaprimer PCR method (32). All PCRs were 
performed under identical conditions using Ambion 
SuperTaq™ Plus polymerase and conditions described 
earlier (33). All the RNA transcripts were synthesized 
with the Ambion T7-MEGAshortscript following the 
manufacturer's protocol. Transcripts were purified by de- 
naturing gel electrophoresis (8 M urea), followed by 
elution and ethanol precipitation. Purified RNAs were 
dissolved in sterile water and stored at — 20° C. 
Oligoribonucleotides (TAR hairpin III, SL1 upper part, 
*1, SD, *2 and *3), Cy5-labelled DNA and 2'-0- 
MeRNA were created by chemical synthesis using 
solid-support aided phosphoramidite chemistry, and in 
the case of RNA, 2'-0-TOM protected phosphoramidites 
(Glen Research) were utilized. Fluorescent labels were 
introduced at the 3'-end using commercial Cy5-modified 
solid supports (Glen Research). After deprotection, oligo- 
mers were purified by denaturing gel electrophoresis 
(8M urea), followed by elution and desalting on NAP 
columns (Amersham Biosciences). 

RNA labelling 

RNA molecules were labelled post-transcriptionally 
with 32 P at the 5'-end or intramolecularly during 
in vitro transcription with [a- 32 P]-GTP (Ambion 
T7-MEGAshortscript). 5'-end-labelling of RNA mol- 
ecules with 32 P was performed using [y- 32 P]-ATP (ICN) 
and the Ambion KinaseMax kit according to the protocol. 
Oligonucleotides obtained by the chemical synthesis were 
5'-end-labelled using [y- 32 P]-ATP and T4 polynucleotide 
kinase (Fermentas) according to protocol. Labelled RNAs 
and DNA primers were purified by denaturing PAGE, 
followed by elution in water and ethanol precipitation. 
Samples were dissolved in sterile water and stored 
at -20°C. 

In vitro dimerization of HIV-2 RNAs 

The homogeneity of all RNA molecules was checked by 
denaturing PAGE. Aliquots of 32 P-labelled RNAs 
(5-10 pmol) in sterile water were denatured at 90°C for 
lmin. A buffer containing lOmM Tris-HCl (pH 7.5), 
130mM KC1, 0.5 mM EDTA and the appropriate 
MgCl2 concentration (0-5 mM) was added followed by 
slow cooling to room temperature. Loose dimer formation 
was also tested under SHAPE folding and buffers condi- 
tions (see below). For tight dimerization, RNA was 
incubated at 65°C for lOmin and then at 37 or 55°C. 
For the loose dimer formation, the 65° C incubation was 
omitted and RNA was incubated at 37°C. The dimeriza- 
tion time was established individually for each transcript 
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and is indicated on the figures. To stabilize the RNA con- 
formers, the samples were kept on ice for 1 min, then 2 ul 
30% glycerol or 25% Ficoll were added and samples were 
loaded directly onto the gel. The process of RNA dimer- 
ization was studied using gel electrophoresis under native 
(TBM, 4°C) or semi-native (TBE, 4°C) conditions. As a 
control, RNA denatured in formamide (5min/100°C) was 
also loaded onto the gel. 

HIV-2 RNA structural studies using antisense 
oligonucleotides 

RNA molecules were labelled with 32 P, and oligonucleo- 
tides with Cy5 or unlabelled. RNAs (5-10 pmol) were 
folded as described for the dimerization assays and incu- 
bated at 37°C for 5 to 10 min. The dimerization buffer with 
5mM MgCl 2 was used. Subsequently, the appropriate 
oligonucleotide (DNA or 2'-(9-MeRNA) was added in a 
10- and 20-fold molar excess and the samples were 
incubated at 37°C. The loose dimerization was allowed 
to proceed for an additional 15 min. For analysis of 
tight dimers the incubation time varied according to tran- 
script length as follows: 2h for +1-560 RNA and 10 min 
for shorter transcripts encompassing the leader 3'-end. 
The samples were kept on ice for 1 min, 2 ul of 30% 
glycerol or 25% Ficoll were added and the samples were 
loaded directly onto a TBM or TBE gel (4°C). 

In silico two-dimensional RNA structure prediction 

Two-dimensional (2D) RNA structures were predicted 
using RNAstructure PC v. 4.3 or 4.6 with default 
settings applied (34). We considered all the secondary 
structures predicted within 20% of free energy of the 
most stable variant, subsequently changing the window 
size parameter to generate closely related structures. For 
the SHAPE-constrained folding, default parameters of 
RNAstructure PC v. 4.6 were used. 

Enzymatic structural probing of RNA 

Limited digestions with nucleases SI (Fermentas), RNase 
Tl (Boehringer), RNase VI (Ambion) and RNase A of the 
5'-end-labelled target RNAs (30 000 c. p.m. per sample) 
were performed in a buffer containing 10 mM Tris-HCl 
(pH 7.2), 40 mM NaCl, 8mM carrier RNA (Ambion) and 
5 mM MgCl 2 . Prior to the reactions, RNAs were refolded 
by heating at 90° C for 1 min in reaction buffer and slow 
cooling to 37°C. The renatured RNAs were incubated for 
10 min at 37°C with increasing concentrations of the re- 
spective enzyme: Tl (0.2-0.4 U), VI (0.01-0.0025 U), A 
(0.1-0.01 ng) or SI (0.5-1 U). For limited digestion with 
RNase III (Ambion), the reaction was carried out at 37°C 
for 5 min with 0.5 or 1U of enzyme. Reactions were 
terminated by addition of an equal volume of stop 
buffer (8M urea, 20 mM EDTA, 0.02% xylene cyanol 
and 0.02% bromophenol blue) and freezing on dry ice. 

Mapping oligodeoxynucleotide hybridization within the 
HIV-2 leader RNA 

Prior to the reaction, RNAs (30 000 c.p.m.) were subjected 
to refolding as described above. Subsequently, 2 pmols of 



appropriate oligodeoxynucleotide were added and the 
mixture was incubated for 10 min at 37°C. The hybridiza- 
tion sites were detected using 0.25-0.5 U of RNase H at 
37°C for 5 min. Reactions were quenched by the addition 
of an equal volume of stop buffer and freezing on dry ice. 

Chemical structural probing of RNA 

Probing with DEPC. For chemical probing with diethyl- 
pyrocarbonate (DEPC), the target 5'-end-labelled RNAs 
(50 000 c.p.m) were refolded as described above. Reactions 
(10 ul) with 0.5 ul DEPC were allowed to proceed for 5, 15 
or 30 min at 37°C. Samples were ethanol precipitated twice 
in the presence of 5 ug carrier RNA (Ambion) and treated 
with 10 ul of 1 M aniline-acetate buffer (pH 4.5) according 
to standard protocol (35). Before electrophoresis, the 
RNAs were dissolved in 5 ul of sterile water and an 
equal volume of formamide with tracking dyes. 

Ph 2+ ion-induced cleavage. Prior to Pb 2+ ion-induced 
cleavage, RNA transcripts (30 000 c.p.m.) were subjected 
to refolding as described above with a reaction buffer con- 
taining 10 mM Tris-HCl (pH 8.0), 40 mM NaCl, 8mM 
carrier RNA and 5mM MgCl 2 . Afterwards, freshly 
prepared Pb(OAc) 2 solution was added to a final concen- 
tration of 0.25, 0.5, 1 and 2mM and the reaction mixtures 
were incubated for 15 min at room temperature (36,37). 
Reactions were quenched by the addition of an equal 
volume of stop buffer and freezing on dry ice. 

All products of nuclease digestion and chemical 
cleavage were analysed by denaturing PAGE in lx 
TBE. In order to assign RNA cleavage sites, reaction 
products were subjected to electrophoresis along with 
the products of alkaline RNA hydrolysis (formamide 
with 3mM MgCl 2 at 100°C for 20 min) and limited Tl 
nuclease digestion [0.2 U, 50 mM sodium citrate (pH 5.3) 
and 7M urea at 55°C for 10 min] of the same RNA. 

NMIA treatment of RNAs. RNAs (20 pmols) were heated 
at 90° C for lmin in 20 ul of renaturation buffer [10 mM 
Tris-HCl (pH 7.5), lOOmM KC1, 0.1 mM EDTA] and 
slowly cooled to 4°C. Subsequently, 97 ul of water and 
29 ul of 5x folding buffer [200 mM Tris-HCl (pH 7.5), 
650 mM KC1, 2.5 mM EDTA, 25 mM MgCl 2 and 40U 
RNase inhibitor] were added and the RNA was incubated 
at 37°C for 10 min. For NMIA modification reactions 
taking place in the presence of oligonucleotides, it was 
at this point that the oligonucleotide was added, 
followed by the incubation for 10 min at 37°C. The 
mixture was divided into equal parts, treated with 7.3 ul 
of 180mM NMIA in anhydrous DMSO (+) or DMSO 
alone (— ) and allowed to proceed at 37°C for 50 min. 
The RNA was precipitated and resuspended in 10 ul of 
water. Modified sites were detected by primer extension, 
followed by electrophoretic fragment separation. 

The primer extension reactions were performed as 
described earlier (38) except that 5 mM DTT and 1 M 
betaine were added. Dideoxy sequencing markers were 
generated using unmodified RNA. Several 5'-end-labelled 
DNA primers were used to analyse the entire HIV-2 leader 
RNA or shorter transcripts derived from it. The sequences 
of these primers are provided in Supplementary Table S2. 
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cDNA band intensities for the (+) and (— ) reactions were 
integrated using SAFA (39) and corrected for stochastic 
drop-off (40). The control reaction (— ) was subtracted 
from NMIA reaction (+) and SHAPE reactivities were 
normalized to a scale in which 0 indicates an unreactive 
residue and the average intensity at highly reactive 
residues is set to 1.0, as described previously (7). 

Cloning of recombinant HIV-2 NC protein 

A synthetic gene was designed based on the sequence of 
the HIV-2 ROD NC protein (NCp8) taken from http:// 
ncbi.nml.nih.gov. The gene sequence was codon optimized 
and engineered to contain the EcoRI restriction site at its 
5'-end and the BamHI restriction site at its 3'-end. This 
gene was synthesized based on a PCR method utilizing 
single-stranded overlapping oligonucleotides. The PCR 
product was purified by phenol extraction and ethanol 
precipitation. After EcoRI and BamHI digestion (37°C/ 

1 h), phenol extraction and ethanol precipitation, this 
insert was cloned into the pGEX-4T-3 expression vector 
as a GST fusion gene. The identity of the clone was con- 
firmed by nucleotide sequencing. 

HIV-2 NC protein expression and purification 

The GST-NCp8 protein was expressed in Escherichia coli 
BL21— Codon Plus (DE3) — RIL cells (Stratagene) and 
purified by affinity chromatography on Glutathione- 
Sepharose (Amersham Pharmacia Biotech.) using a 
batch purification method. E. coli cells were grown in 
the LB medium at 37°C to an A600 of 0.5, after which 
expression of GST-NCp8 was induced with 1 mM IPTG. 
The cells were harvested 4 h after induction by centrifuga- 
tion (4000 rpm, 15min, 4°C) and resuspended in 50 ml 
lysis buffer [50 mM Tris-HCl (pH 7.9), lOOmM KC1, 
1% Triton X-100, 50mM DTT, 0,1 mM PMSF, 2mg/ml 
benzamidine]. The cells were lysed by three repeated freeze 
and thaw cycles (— 80°C/0°C), followed by sonication. The 
cell debris was pelleted by centrifugation (15 000 rpm, 
40min, 4°C), the resulting supernatant was mixed with 

2 ml Glutathione-Sepharose and incubated with gentle 
agitation at 4°C for 2h. The suspension was centrifuged 
(1500 rpm, 2min, 4°C) and the beads were washed twice 
with 10 ml buffer I [PBS (pH 7.4), 1% Triton-X 100, 
50 mM DTT, protease inhibitors (Roche)], 10 ml buffer 
II [50 mM Tris-HCl (pH 7.5), 300 mM NaCl, 50 mM 
DTT, protease inhibitors], 10 ml buffer III [50 mM Tris- 
HCl (pH 7.5), lOOmM KC1, 10% glicerol, 50mM DTT] 
and 10ml cleavage buffer [50 mM Tris-HCl (pH 7.5), 
20 mM KC1, 10 mM DTT]. Cleavage of the GST-NCp8 
fusion protein with thrombin (10 U) occurred overnight at 
4°C (mixture volume up to 0.5 ml), and the protein was 
stored at 4°C in cleavage buffer. The yield was ~5mg of 
purified protein per liter of culture media. 

Protein expression and purification were monitored by 
SDS-PAGE and Coomassie Brilliant Blue staining. 
Molecular mass determination was performed using an 
Autoflex MALDI TOF mass spectrometer (Bruker 
Daltonics). 



NCp8/HIV-2 RNA gel retardation assay 

Prior to the binding reaction, RNA samples labelled with 
32 P were refolded. RNA-protein complex formation 
(volume 10 ul) was carried out in a buffer containing 
50 mM Tris-HCl (pH 8.0), 20 mM KC1, 100 mM DTT, 
0.1% Triton X-100 for lOmin on ice. The binding was 
tested at different ZnCl 2 (0-100 uM) concentrations. In 
order to test the NCp8 binding specificity reactions were 
carried out in the presence of 100-fold molar excess 
of carrier RNA (Ambion). The complexes were mixed 
with 2 Lil of 25% Ficoll with dyes and loaded directly 
onto a non-denaturing polyacrylamide gel (acrylamide/ 
bisacrylamide, 75:1) containing 0.5 x TB with 0.1% 
Triton X-100. Electrophoresis was carried out at a low, 
strictly controlled gel temperature (4°C). The equilibrium 
constants (K d = [NC][RNA]/[NC-RNA]) were estimated 
using 1 pmol RNA and dilutions of NCp8 [from 50 nM 
to 2 uM for TAR, poly(A) and SL1; 500 nM to 50 uM for 
*1, SD, *2 and *3]. 

Mapping the RNA binding sites for NCp8 

The NCp8 binding sites within the RNAs were detected 
using Pb 2+ ion-induced cleavage. NCp8/HIV-2 RNAs 
complexes were formed as described above, followed by 
addition of carrier RNA (0.5 ug, Ambion) and Pb(OAc) 2 
solution to final concentration 2mM and by incubation 
for lOmin at 0°C. The reactions were quenched by the 
addition of 40 ul of 2 mM EDTA and ethanol precipita- 
tion. The reaction products were analysed by denaturing 
PAGE in 1 x TBE. 

Results from all PAGE experiments with radiolabeled 
or fluorescently labelled RNAs and DNAs presented in 
this study were visualized and quantitatively analysed 
using a Typhoon 8600 Imager with ImageQuant 
software (Molecular Dynamics) or an FLA-5100 with 
MultiGaugeV 3.0 (FujiFilm). 

In all cases, at least three independent experiments were 
performed, and the data presented are representative of 
the whole. 



RESULTS 

HIV-2 leader RNA loose dimer structure 

The secondary-structure model of the HIV-2 leader RNA 
is based on the enzymatic and chemical probing data 
related to the monomer state (17,23). Although several 
studies attempted to address rearrangements occurring 
within the leader RNA upon dimerization (17,22,24), the 
structure of the HIV-2 leader RNA dimer is still unknown. 
To reveal the HIV-2 leader RNA loose dimer structure, we 
applied the SHAPE method (41) which had been success- 
fully used to identify kissing-loop interactions (40,42). 

The length of the HIV-2 leader RNA, encompassing 
+ 1-546 nt, was extended to 560 nt to include a fragment 
of the Gag coding sequence proposed to be involved in the 
U5-AUG duplex formation (Figure 1A) (8,9). Analysis of 
the 560-mer leader transcript using TBM gels and differ- 
ent folding conditions (Figures IB, C and 2A) indicates 
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that loose dimers were formed very efficiently at 37°C in 
the presence of Mg 2+ . After 10 min, over 50% of the RNA 
was in the dimeric form (Figure 2A). The comparison of 
TBM and TBE gel patterns shows that tight dimers, under 
identical folding conditions, were formed very inefficiently 
even after prolonged incubation (compare Figure 1C, 
fourth lane and ID). Consequently, under SHAPE 
folding and NMIA modification conditions (38), most of 
the HIV-2 leader RNA molecules were in the loose dimer 
form (Figure 2A). 

Well-resolved SHAPE data, supported by native gel 
patterns, point to the one global fold of the HIV-2 
leader RNA (Figures 1C and 2 and Supplementary 
Figure SI) captured in vitro in its stable loose dimer 
state. In this dimeric structure, the TAR domain exists 
in its branched form (43) (Figure 2B). This domain 
(+1-123), in its isolated state for HIV-2 (33) and in vivo 
for SIV (44), undergoes an interconversion between a 
branched hairpin form and an extended form. The equi- 
librium state in vitro is highly susceptible to the folding 
conditions. Analysis of the isolated TAR migration on 
native gels indicates that under SHAPE folding condi- 
tions, the branched hairpin form prevails (Supplementary 
Figure S2). 



SHAPE indicates that the poly(A) domain (+125-184), 
presented earlier as a hairpin with a large 29-nt apical loop 
(45), is structured. Residues A144 to U149, CI 58 and 
G160 to CI 62 were not modified by NMIA (Figures 2B, 
C and 3D), suggesting that they are constrained. 
Interestingly, three structural models were generated in 
silico for the poly(A) domain, using RNAstructure 
software (34), but none of the predicted structures fit the 
experimental data. To address whether those residues are 
involved in long-range interactions with other domains of 
the leader RNA, we performed extensive enzymatic 
and chemical structure probing of the isolated poly(A) 
hairpin encompassing +125-184 residues (Figure 3 A and 
C). RNase A, specific for single-stranded pyrimidines, 
cleaved only 4 of 13 pyrimidines located in the apical 
loop and none of the residues resistant to NMIA modifi- 
cation were recognized. In addition, residues C145-U149 
and C158-C162 were cleaved by RNase VI and not by 
nuclease SI. The use of RNase III led to a strong cleavages 
at U138 and U172 and two weaker hits at C162 and U134, 
supporting the existence of a structured section within the 
loop of the poly(A) hairpin. These and other findings are 
summarized in Figure 3C. Our analyses exclude a possi- 
bility that these unexpected structural features of the 
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Figure 2. (A) Analysis of the loose dimer formation by the HIV-2 leader RNA (+1-560) in the SHAPE buffers and folding conditions (TBM agarose 
gel). (B) Secondary-structure model of the HIV-2 leader RNA captured in the loose dimer form. Two kissing-loop interfaces are indicated as black 
frames. Nucleotide residues accessibilities to NMIA are given in colours and according to scale. (C) Processed SHAPE reactivities as a function of 
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reactivity distributions for the HIV-2 leader RNA. Box outline middle 50% of dataset; the median is shown with heavy line. Circles indicate extreme 
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poly(A) domain result from interactions with other elem- 
ents of the leader RNA. In addition, our data rule out the 
formation of a pseudoknot between nucleotide residues 
within the apical loop of poly(A) and residues located in 
the coding region, a structural feature which has been 
recently proposed for HIV-1 (7). 

The data obtained point to the extended SL1 forming 
stem B (8,1 1,21), rather than the short hairpin (+409^36) 
(9,12,22,23). Residues of the 3'-end of PAL 
(G397CUCC401) were not modified by NMIA and they 
form the stem B of SL1 (Figures 2B, C and 3D). The 
in vitro data obtained for the captured loose dimer 
exclude the presence of in silico predicted stem-loop struc- 
tures encompassing the *I* region containing the PAL 
sequence (19,24). Based on enzymatic and chemical 
probing data obtained for the shorter transcripts con- 
taining SL1, we propose the existence of several non- 
canonical base pairs closing bulges and loops within 
the SL1 structure (Figure 3B, C and Supplementary 
Figure S3). 

Analysis of the data obtained for the HIV-2 leader 
RNA 3'-end (+380-560) (Figure 2B and C and 
Supplementary Figure SI) confirms the presence of char- 
acteristic short hairpins Vfl, SD, 4*2 and ^3 presented in a 
previous model (23). In addition, the U5-AUG (CGI) 
interaction (8,9) is confirmed. The lack of NMIA modifi- 
cation indicates that residues G543 to C553 (G-box) 
are paired with G186 to C196 (C-box). As presented in 
Figure 2B and C and Supplementary Figure SI, the 
analysis of the SHAPE data also supports base-pairing 
between segments C3 85 CACGAC3 91 and G 535 AUUGUG 
G 542 . 

The secondary-structure model of the HIV-2 leader 
RNA proposed here is based on the modification 
pattern where single-stranded regions are reactive 
towards NMIA (Figure 2B and C). However, two excep- 
tions were found that are consistent with the presence of 
tertiary interactions involved in the loose dimer forma- 
tion. Both of them represent a kissing-loop interaction. 
Specifically, the SL1 palindromic sequence G 42 oGUACC 
425 and a self-complementary sequence within the TAR 
(UiooUGGCCGG 10 7) were largely unreactive towards 
NMIA. The latter of those sites is located within the 
short 3' hairpin of the TAR domain, and is characteristic 
of both TAR forms (abbreviated herein as hairpin III). 
Within the aforementioned sequence, only U101 which 
is involved in an intermolecular G-U pair formation, 
was found to be susceptible to NMIA modification 
(Figure 2C and Supplementary Figure SI). Similar to 
the data obtained with SL1 (Figure 3B and C), the hairpin 
III loop in the monomeric state was fully accessible to 
several enzymes and chemical reagents used to probe the 
isolated motif (+93-1 13) and a longer RNA encompassing 
TAR (+1-123) (33). In addition, both short constructs 
representing the upper part of the SL1 and the isolated 
hairpin III of TAR were found to undergo the transition 
from loose to tight dimers very efficiently (Supplementary 
Figure S4). To further confirm the participation of the 
TAR hairpin III apical loop in formation of the 
kissing-loop complex, an RNA mutant was constructed. 
The UioiGGCC 10 5 sequence of the TAR hairpin III apical 



loop was changed to A101AAAA105 in the UTRh3 mutant 
of the HIV-2 leader RNA. The NMIA modification 
pattern obtained for the UTRh3 mutant transcript 
(Figure IE) shows that, contrary to the wild-type leader 
RNA, residues of the hairpin III apical loop were reactive 
towards NMIA. SHAPE data also showed that the sec- 
ondary structure of the TAR domain in the mutant RNA 
remained unaffected. The analysis of the dimerization 
properties of UTRh3 RNA using TBM gels, indicates 
that loose dimers were formed less efficiently relative to 
the wild type HIV-2 leader RNA (Figure IB). 
Furthermore, the loose dimer formation was assayed in 
the presence of an oligodeoxynucleotide complementary 
to the apical loop and upper stem of SL1 (probe 1) 
(Figure IB). The oligodeoxynucleotide was added to the 
pre-folded RNA. Although the strong inhibitory effect of 
probe 1 was observed for both wild type and mutant RNA, 
it should be emphasized that the presence of probe 1 
almost completely abolished UTRh3 loose dimerization. 
In this case, one contact site is disrupted by the mutations, 
and the second by hybridization with probe 1. To extend 
our findings on the interaction interfaces in the loose 
dimer, we also tested the ability of the oligodeoxy- 
nucleotide complementary to TAR hairpin III (probe2) 
and SL1 (probe 1) to suppress loose dimer formation of 
the wild-type leader RNA (Figure 1C). While some reduc- 
tion of loose dimerization was observed in the presence of 
probe2, the effect was not as pronounced as for probe 1. 
However, when both probes were included in the hybrid- 
ization mixture, the inhibitory effects on dimer formation 
were additive (Figure 1C). 

PAL/SL1 rearrangement 

It has been proposed that the PAL sequence plays a regu- 
latory role in SLl-mediated HIV-2 RNA dimerization and 
monomeric RNA can adopt two alternative conform- 
ations (24). The first conformation, with an extended 
SL1 and PAL partially base-paired in the stem B, 
impairs dimerization. The second, with a short SL1 
(+409—436) and PAL involved in formation of an add- 
itional stem-loop motif (19), favours dimerization (24). 
Because our studies of the leader RNA (+1-560) in the 
conditions applied do not show indications of PAL/SL1 
rearrangement, we decided to study this region within dif- 
ferent structural contexts. A structural interconversion 
that involves only a limited number of base pairs within 
a relatively small region might be energetically more fa- 
vourable in a shorter transcript. Transcripts +397-445, 
+397-463, +377^163, +377-560 (Figure 3B and C and 
Supplementary Figure S3) were subjected to structural 
probing using selected single-strand specific (SI, Tl, A) 
and double-strand specific (RNase VI) enzymes and 
chemical reagents (DEPC and Pb 2+ ions). The data 
obtained show that in our in vitro conditions, the second- 
ary structure of the extended SL1 remains unchanged re- 
gardless of the presence of adjacent elements at the 5' and/ 
or 3'-ends. 

To allow direct comparison to the SHAPE data 
obtained for the full-length leader RNA, two transcripts, 
+377-560 and +397-560 were probed with NMIA 
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(Supplementary Figure S5). Only the +377-560 RNA 
contains the stretch of nucleotide residues that would 
allow the proposed PAL/SL1 structural rearrangements 
(24). NMIA reactivity profiles indicate that the structures 
of transcripts +377-560 and +397-560 are the same, and 
match that of the 3'-end of the entire HIV-2 leader RNA 
in vitro. 

The structure of the HIV-2 leader RNA 3'-end (+380- 
560) was also analysed in the context of the +1-891 tran- 
script. Although this transcript encompasses a much 
larger fragment of the Gag coding sequence, no differ- 
ences in the organization of leader RNA were found 
(Supplementary Figure SI). 

The features of the extended SL1 structure characteris- 
tic of the isolated hairpin are retained within all RNAs 
analysed, regardless of the additional 5' and/or 3' frag- 
ments of leader RNA, indicating that adjacent structural 
motifs fold independently. There is no evidence in these 
data to suggest that multiple thermodynamically stable 
forms of PAL/SL1 region exist in the monomeric RNA. 

The SL1 palindrome is dispensable for the stability of the 
tight dimer 

The HIV-2 leader RNA forms tight dimers in vitro very 
inefficiently (11,12). However, as reported earlier, forma- 
tion of tight dimers was induced by folding the target 
RNA in the presence of antisense oligonucleotides 
directed against such regions as U5-AUG duplex, PAL 
or SL1 (8,12). To investigate the structural properties of 
the SL1 within the pre-folded leader RNA and to resolve 
the influence of oligonucleotide probes on tight dimer for- 
mation, a set of hybridization experiments was performed. 
Initially we used an oligodeoxynucleotide (probe 1) com- 
plementary to the 5' upper stem and apical loop of SL1 
(+407^426) of the same sequence as reported earlier for 
the probe asSLl (8,12). Probel, labelled at the 3'-end with 
the fluorescent marker Cy5, readily associated with the 
pre-folded RNA present in a 1:1 ratio of monomer to 
loose dimer. Inclusion of the probel in the mixture 
led to an increase in leader RNA tight dimerization 
(Figure 4A). The amount of dimer formed in the presence 
of probel after 2h at 37°C is comparable to that obtained 
without probel after 8h at 55°C. TBE gel patterns show 
that the fluorescent probel co-migrates with both the 
monomer and the tight dimer. An RNase H assay con- 
firmed that probel was bound at the expected location 
within the SL1 domain. Residues G411 to C425 were 
cleaved within the 377-560 RNA, indicating strand 
invasion (Supplementary Figure S6). 

To reveal the mode of action for probel, two shorter 
2'-0-Me oligoribonucleotides 3'-end-labelled with Cy5 
were prepared: probelA complementary to the apical loop 
(+416^126) and probe IB complementary to the 5' upper 
stem of SL1 (+407^17). Each of them was hybridized to 
transcripts representing the 3'-end of the leader RNA but 
differing with respect to the presence of the * region en- 
compassing the PAL palindrome (RNAs +377-560 and 
+397-560). Transcript +377-560 contains the entire PAL 
sequence, whereas +397-560 includes only the 3' part of 
PAL (i.e. that involved in SL1 stem B formation). Probel 



and probel A significantly increased the level of tight 
dimers for the +377-560 transcript, but not for the 
+397-560 RNA (Figure 4B), indicating that the * 
region plays a role in tight dimerization, as previously 
proposed (20). In contrast, probe IB triggered an 
increase in tight dimer formation for both transcripts, 
indicating that tight dimerization induced by this probe 
is ^ independent. However, in the case of the full-length 
leader RNA (+1-560), the influence of probes 1A and IB 
on tight dimerization was not as pronounced as in the case 
of probel (data not shown). 

SHAPE was applied to reveal probel -induced changes 
within the full-length leader RNA structure. Probel was 
added to the pre-folded leader RNA (monomer to loose 
dimer ratio 1:1) and the resultant complexes were treated 
with NMIA. Although the SHAPE readout represents the 
superimposed reactivities of both free and bound leader 
forms, some structural features are apparent (Figure 4C). 
Interesting structural rearrangements are observed in the 
SL1 stem B. Nucleotide residues G437-C443 showed an 
increase of their reactivity towards NMIA in relation to 
the leader alone. Residues G397-C401 remained unreact- 
ive. These two stretches of residues in the monomeric 
leader or its loose dimer fold into the SL1 lower stem 
(stem B). In addition, the 5'-end of the PAL was protected 
from NMIA modification and residues immediately 
upstream gained reactivity. No additional changes in the 
reactivity pattern of the leader RNA in the presence 
of probel were observed (Figure 4C, Supplementary 
Figure S7). This indicates that the rearrangements of the 
secondary structure induced by probel are restricted to 
the PAL/SL1 region. In addition, it shows that the SL1 
apical loop is not involved in base pairing with other 
regions of HIV-2 leader RNA. 

The SL1 motif is a high-affinity binding site for NCp8 

An expression vector was constructed that generates the 
HIV-2 ROD NCp8. The gene was cloned into the 
pGEX-4T-3 vector as a GST fusion. The clone over- 
expresses in E. coli efficiently and yields ~5 mg of pure 
NCp8 protein after GST removal with thrombin 
(Supplementary Figure S8A). The resultant protein con- 
tains two additional residues at the N-terminus due to the 
protease specificity. MALDI-TOF showed the expected 
NCp8 molecular weight (Supplementary Figure S8B). 

The specificity of NCp8 interaction with 
32 P-end-labelled RNA transcripts corresponding to 
isolated domains of the 5'-UTR has been analysed using 
an electrophoretic mobility shift assay (EMSA). Titration 
of the transcripts encompassing isolated domains: TAR, 
poly(A), SL1, *1, SD, *2 and *3 with NCp8 resulted in 
the formation of a new, distinct bands, albeit with differ- 
ing intensities (Figure 5 A and Supplementary Figure S9). 
NCp8 has the highest affinity for SL1 with a dissociation 
constant of 125 ± 25 nM. At least three discrete bands 
were formed, in accord with multiple proteins interacting 
with RNA. Complexes remained unaffected in the 
presence of 100-fold molar excess of an unrelated competi- 
tor RNA, confirming the specificity of the interaction 
(Supplementary Figure S9). The NCp8/SLl dissociation 
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constant is similar to that reported for HIV-1 NCp7 and 
SL3 RNA (28). 

Footprinting analysis of the NCp8/SLl complex using 
Pb 2+ induced RNA cleavage was performed under strin- 
gent conditions that allow most of the base-paired nucleo- 
tides to be cleaved. Titration of the +397^+63 transcript 
with NCp8 resulted in a decrease in RNA cleavages at 
residues C410-C417 and A432-U438 at low-protein con- 
centration (Figure 5B). This indicates that the SL1 upper 
stem is the NCp8 binding site. Our structure probing data 
obtained for isolated SL1 in the absence of protein show 
that this binding site contains a purine-purine base pair 
(G413-A432) next to the unpaired G414. The G414 
residue was susceptible to Pb 2+ induced hydrolysis. The 
lack of reactivity of the A432 residue towards DEPC 
suggests that its Hoogsteen site is involved in base-pairing 
with G413. 

Footprinting analysis also demonstrates that cleavages 
at C425 and C431 were enhanced at low protein 



concentration, while in the presence of high concentra- 
tions of NCp8 most of the residues previously protected 
became reactive. The data are consistent with the 
destabilizing activity of NC proteins on the RNA struc- 
ture (46,47). 

The affinity of NCp8 for the other isolated hairpin 
motifs present at the 3'-end of HIV-2 leader RNA is 
much lower than that for SL1. K d s for ^l and SD were 
~4-5uM, for *2 5-6 uM and >10uM for *3, suggesting 
that the observed associations are likely the product of 
nonspecific binding. Interestingly, a much higher affinity 
was observed for TAR and poly(A) isolated domains 
(K d = 450 ± 50 and 550 ± 50 nM, respectively). 

DISCUSSION 

The HIV-2 leader RNA structure in the loose dimer form 

It has been recently demonstrated that in vitro transcribed 
HIV-1 leader RNA folds into the same structure as RNA 
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present in virus particles, and that this structure represents 
single fold (7). It was suggested that the HIV-2 leader 
RNA monomer may adopt an alternative conformations 
with different abilities to dimerize (17,24). Our results of 
enzymatic and chemical structural probing, supported by 
native gel electrophoresis, show that in the presence of 
magnesium, the HIV-2 leader RNA strand shows prefer- 
ence to exist in vitro as a single conformer. The data col- 
lected from the SHAPE probing correspond to the leader 
RNA loose dimer structure. This is due to the preferential 
formation of the loose dimer by the HIV-2 leader RNA 
and to the mild action of the NMIA reagent, which is 
non-disruptive to the loose dimer (38,41,42). 

To our knowledge, this is the first study to structurally 
characterize the HIV-2 leader RNA captured in one of the 
potential kissing-loop complex forms. Here, we document 
the existence of two contact interfaces involving palin- 
dromes: the SL1 apical loop and the TAR hairpin III 
apical loop. Although the involvement of the SL1 in di- 
merization has been studied extensively, the role of the 
SL1 apical loop has not been fully established yet. Some 
reports underlined the importance of the SL1 apical loop 
(11,12,17,24) while others disputed its involvement in di- 
merization (10,20). Our data indicate that in vitro the SL1 
apical loop palindrome contributes to the loose dimer for- 
mation. Participation of the TAR domain in HIV-2 RNA 
dimerization has also been suggested (14,15), but has not 
been proven, as is the case for HIV-1 (15,48). However, it 
was shown that deletion of the TAR caused a significant 
drop in the efficiency of HIV-2 leader RNA loose and 
tight dimer formation in vitro (17). Based on our results, 



we are able to support the involvement of TAR in dimer- 
ization and to propose that TAR hairpin III serves as a 
kissing-loop site. As we presented previously, a phylogen- 
etic comparison of TAR sequences of HIV-2 and SIV 
isolates revealed that this palindromic sequence is highly 
conserved (33). The only notable change (present in three 
different isolates) is the G106A substitution that would 
not interfere with the palindromic character of this 
sequence. Mutations in the TAR hairpin III palindrome 
markedly reduced the amount of the loose dimer formed 
in vitro (Figure IB). Furthermore, the residues of hairpin 
III apical loop were reactive towards NMIA in the 
UTRh3 mutant (Figure IE), contrary to the wild type 
HIV-2 leader RNA. The oligodeoxynucleotides comple- 
mentary to the SL1 (probe 1) or TAR hairpin III 
(probe2) suppressed the formation of loose dimers by 
the wild type HIV-2 leader RNA (Figure 1C) supporting 
the involvement of both motifs (TAR and SL1) in in vitro 
dimerization. Less pronounced effect of probe2 may 
reflect a supplementary role of the TAR hairpin III in 
the dimerization or less efficient hybridization of this 
probe. In addition, the abolished formation of loose 
dimers by the UTRh3 mutant in the presence of probe 1 
is an expected result in the light of our findings concerning 
two main contact interfaces within the HIV-2 loose dimer. 
Although our data indicate two contact sites in the HIV-2 
RNA loose dimer, our focus was not on the Nar I 
sequence and its participation in dimerization (10). It is 
known that local RNA rearrangements occur upon 
tRNA Lys3 binding (49). Therefore, structural analysis of 
that region is not emphasized here. However, we noted 
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that three of six residues (G304, G305 and C308) from the 
Nar I palindromic sequence were highly reactive towards 
NMIA (Figure 2C). 

Within the loose dimer, the global RNA fold merges 
structural features of the previously proposed models of 
the HIV-2 leader monomers (9,12,17,23). We found that 
the structures of the individual domains within the 
full-length HIV-2 leader RNA are similar of those of the 
isolated ones. Results of detailed enzymatic and chemical 
probing of the isolated domains are fully consistent with 
the SHAPE data for the +1-560 and +1-891 transcripts 
(Figures 2B, C and 3; Supplementary Figures SI and S3). 
Formation of the extended structure of SL1 is inde- 
pendent of the structural context in which it is present, 
indicating the stability of the SL1 stem B. The extended 
SL1 had been previously shown to be necessary for HIV-2 
replication and RNA encapsidation (21). We showed 
that the extended SL1 and the intramolecular U5-AUG 
are characteristic of the HIV-2 leader RNA loose dimer 
structure (Figure 2B). A previous report, based on 
the use of antisense oligonucleotide probes concluded 
that the presence of U5-AUG duplex favoured the 
extended SL1 motif formation (8,12). Although we 
do not dispute the U5-AUG regulatory role in HIV-2 
RNA dimerization, our structural probing data using 
transcripts of different lengths clearly show that the 
extended SL1 is formed irrespective of U5-AUG duplex 
in vitro. 

The SL1 RNA domain is a high-affinity binding site for 
NCp8 

The NC proteins cover the entire retroviral leader RNA 
with high occupancy. To address their binding specificity 
and the structure of RNA/protein complexes, it is useful 
to perform studies on models representing isolated RNA 
domains (28,50). All individual HIV-2 leader RNA 
domains except the PBS were incubated with recombinant 
NCp8 protein (Figure 5A). Although we cannot exclude 
the possibility that the binding affinities of full length 
Gag protein for the various constructs may differ from 
those presented here for NCp8, the fact that SL1 RNA 
is bound with an affinity more than an order of magni- 
tude higher than for SD, ^2 or *3 is strongly sug- 
gestive of specific recognition of this structural element by 
HIV-2 NC. 

Footprinting experiments reveal that the upper stem of 
SL1 is the binding site for NCp8 (Figure 5B). An interest- 
ing feature of this binding site is the presence of a purine- 
purine base pair (G413-A432) next to the unpaired G414. 
It has been demonstrated that the HIV-1 NC protein 
prefers binding to unpaired G residues (28,51). 
Moreover, our footprinting data demonstrate that NCp8 
binding destabilizes SL1, in accord with the accepted 
opinion that NC proteins destabilize continuous 
double-stranded RNA structures (46,47,52). 

The HIV-1 NC protein NCp7 binds the HIV-1 DIS 
(SL1) and ^ (SL3) domains with similar affinity, with 
^ d 's ranging from 100 to 200 nM (28). Interestingly, the 
NCp8 dissociation constant for SL1 is much lower than 
for other hairpins of the leader 3'-end (Figure 5A). It has 



been shown (27) that the heterologous NCp7 protein pref- 
erentially binds the ^3 domain within HIV-2 leader RNA 
which structurally resembles the HIV-1 hairpin * (SL3). 
The complex between the 4*3 hairpin and NCp8 was 
studied by NMR (50). We observed that *3 cannot be 
considered a specific binding site for NCp8 in vitro due 
to the high measured (above 10 uM). Furthermore, the 
*3 motif is not essential for HIV-2 dimerization or 
packaging (19). One can speculate that tight and prefer- 
ential binding of SL1 by NCp8 supports the hypothesis 
that the failure of HIV-2 to package the HIV-1 RNA may 
in part be due to the differing specificities of the two NC 
proteins. 

Interestingly, NCp8 showed a much higher affinity for 
the HIV-2 TAR and poly(A) isolated domains than for the 
hairpin motifs located downstream of SL1. The relatively 
tight binding of NCp8 to the TAR domain is especially 
interesting in view of the data presented here indicating 
the involvement of the TAR hairpin III in the formation 
of loose dimers. In addition, in clinical isolate from HIV-2 
infected patients, a gag mRNA species exists that differs 
from the genomic RNA due to the absence of the +61-202 
intron in the leader region (53). An important question 
in this case is how unspliced viral RNAs are differentiated 
by Gag protein from those of the spliced species. The 
intron contains half of the TAR structure and the 
poly(A) hairpin. Our results with respect to NCp8 
binding to the TAR and poly(A) domains may indicate 
that these two important binding sites are missing in the 
spliced RNA resulting in differentiation between both 
RNA species. 

Structural changes during the initiation of HIV-2 leader 
RNA dimerization 

Our results indicate that the HIV-2 leader RNA prefers to 
form loose dimers in vitro, and in accordance with 
previous reports (11,12), tight dimers were shown to 
form very inefficiently (Figure ID). However, tight dimer- 
ization had been induced by antisense oligonucleotides 
added prior to denaturation and subsequent folding of 
the HIV-2 leader RNA (8,12). As this approach might 
lead to substantial changes in the RNA fold, we added 
the oligonucleotide probes to the pre-folded HIV-2 
leader RNA at an equilibrium of 1:1 monomer to loose 
dimer. The fluorescent oligodeoxynucleotide probe 1 
promoted tight dimer formation, and stayed bound to 
both the monomer and tight dimer species of the leader 
RNA (Figure 4A). This allows us to conclude that the SL1 
structures in both the monomer and loose dimer are sus- 
ceptible to hybridization. However, the mechanism by 
which probe 1 participates in tight dimer formation was 
difficult to unravel in detail. Our data obtained for two 
shorter 2'-0-MeRNA probes 1A and IB (Figure 4B) show 
not only that both parts of probe 1 are necessary for effi- 
cient induction of tight dimer formation, but also that 
both the upper stem and apical loop of SL1 are 
involved. Furthermore, the SHAPE probing revealed 
that upon probe 1 -induced HIV-2 leader RNA tight dimer- 
ization, SL1 stem B undergoes opening (Figure 4C), ex- 
perimentally confirming the previous hypothesis (8). 
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Figure 6. Model of the HIV-2 dimerization initiation in vitro showing rearrangement in the PAL/SL1 region. Palindromic sequences within the SL1 
apical loop and PAL are indicated in blue and in red, respectively. The NCp8 interaction region is presented as ellipse in orange. 



However, it has been suggested that the promotion 
of dimerization is due to the formation of dimerization- 
competent short SL1 (+409-436) (8). Based on our data, 
we conclude that this is a consequence of PAL release. 
In other words, PAL becomes free to participate in alter- 
nate interstrand base pairing interactions that stabilize the 
tight dimer (Figure 6). Data obtained from experiments 
with shorter transcripts representing the HIV-2 leader 
RNA 3'-end confirmed the importance of PAL in SL1 
mediated tight dimerization. The presence of the entire 
PAL palindrome is necessary for the induction of RNA 
tight dimerization by oligonucleotides complementary to 
the SL1 apical loop (probes 1 and 1A). Interestingly, tight 
dimerization of the transcript lacking the 5' part of PAL 
sequence was promoted exclusively by probe IB, comple- 
mentary to the SL1 upper stem. In this case, the tight 
dimer could be formed only by the SL1 apical loop palin- 
drome. These results suggest a possible functional 
exchange between the PAL and SL1 palindromic se- 
quences in the formation of intermolecular interactions 
in vitro. It is also possible that for this transcript, desta- 
bilization of the SL1 upper stem by probe IB induces a 
conformation of the apical loop that favours tight dimer 
formation. 

The local conformational switch in the PAL/SL1 region 
in the leader RNA monomer had been previously 
proposed to regulate HIV-2 RNA dimerization (8,12,24), 
but a compatible and comprehensive model showing the 
structural rearrangements in this region was not estab- 
lished. The importance of PAL in HIV-2 dimerization is 
generally accepted, but its role is less understood 
(12,20,24), and the participation of the SL1 apical loop 
in this process is still a matter of dispute 
(10,11,13,17,20). It should be emphasized that overall, 
our data indicate that in monomeric leader RNA, PAL 
forms stem B rather than the intramolecular structures 
with nucleotides located upstream (19,24), while PAL in 
the tight dimer is involved in the intermolecular PAL/PAL 
interaction, which does not support the regulatory mech- 
anism formerly proposed (24). We have also demonstrated 
that the palindromic sequences in the SL1 apical loop and 
TAR hairpin III, participate in loose dimer formation 
in vitro. On the other hand, our observation that fluores- 
cent probe 1 is bound to the species in the tight dimer form 
(Figure 4A), supports the statement that the SL1 loop 



palindrome is not necessary in vivo for the formation of 
the dimers (20). We hypothesize that if the SL1 apical loop 
palindrome is not present, the TAR hairpin III can sub- 
stitute for its role in the loose dimer formation. These 
dimers could be held together by base pairing between 
TAR hairpins, while SL1 hairpins remain in close prox- 
imity and conducive to maturation by interaction with 
Gag/NC. It is likely that at the initiation stage of the di- 
merization process, several regions are recognized by the 
NC domain of Gag, with SL1 and TAR being the most 
likely candidates. The fact that the main binding sites for 
NCp8 map to the upper stem, but not to the apical loop of 
SL1, suggests that recognition of this motif by NC is 
possible regardless of the presence of a SL1/SL1 kissing- 
loop interaction in the loose dimer. It had been demon- 
strated that NC shows duplex destabilizing activity upon 
interaction with distorted RNA structures (46,47,52,54). 
Indeed, destabilization caused by the binding of oligo- 
nucleotides and/or NCp8 to the upper part of the SL1 
could be translated downstream to stem B leading to 
PAL release. The PAL/PAL interaction can now enhance 
the stability of the maturating HIV-2 RNA dimer. Our 
data most probably reveal only the most stable inter- 
faces within the immature dimer and we believe that 
many more RNA/RNA and RNA/protein interactions 
are involved in the dimer maturation process as previously 
suggested (6). 

In conclusion, we propose a model in which the upper 
part of SL1 serves as a binding site for the NC domain of 
Gag thus stimulating the initiation of HIV-2 dimerization, 
without necessarily being a part of the tight dimer struc- 
ture (Figure 6). Moreover, the kissing-loop formation 
between TAR hairpins III in addition to the interaction 
between SL1 apical loop palindromic sequences plays a 
role in stabilizing HIV-2 loose dimers. However, the 
SL1, once destabilized by NCp8, frees PAL for participa- 
tion in a new type of intermolecular interaction which 
appears to be involved in the formation of tight dimers. 
It remains to be seen whether the TAR kissing-loop inter- 
action is maintained throughout dimerization or is further 
transformed to a more stable dimer element. 
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